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Abstract: Myeloblastosis (MYB) is one of the largest transcribed factor 
families in plants. To gain an overall picture of the evolution of MYB 
genes in relict plants, we cloned nine novel MYB genes in Taxodiaceae 
plants ( Taxodium distichum, Taxodium ascendens, Cryptomeria japonica 
var. Sinensis , Cryptomeria japonica cv. Araucarioides, Cryptomer Ja¬ 
ponica , Metasequoia glyptostroboides, Cunninghamia lanceolata, Tai- 
wania cryptomerioides and Glyptostrobus pensilis). The deduced amino 
acid sequences for MYBs showed that the nine MYB proteins contained 
two DNA binding domains. The first domain is from amino acid position 
29 to 78, wherein three tryptophanes at 33, 53 and 73 were separated by 
19 amino acids, respectively. The second domain is from amino acid 
position 82 to 127, wherein three tryptophanes at 86, 105 and 124 were 
separated by 18 amino acids, respectively, whereas the first tryptophane 
at amino acid position 86 is replaced by a phenylalanine. The characteri¬ 
zation of these conserved domains at nine MYBs indicated that they all 
belong to the R2R3-MYB group. The secondary structure analysis 
showed that a-helix and p-tum are the major motifs of the predicted 
secondary structure of MYBs. The three dimensional model of each 
MYB protein showed that the structure is like clip, making it more flexi¬ 
ble and mobile. The similarities between the nine MYB proteins in 
Taxodiaceae were calculated. The highest identical value of 99% is be¬ 
tween CjsMYB, CjMYB and CjaMYB, whereas the lowest value of 82% 
is between TaMYB and C1MYB. According to the phylogenetic tree, the 
distances between different genera were relatively large whereas those 
within genera were relatively small. As expected, accessions of the same 
genus formed a subgroup before being grouped with other genera. 
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Introduction 

MYB (v-myb avian myeloblastosis viral oncogene homolog), 
one of the largest transcribed factor families in plants, participate 
in plant growth development and metabolic regulation (Moyano 
et al. 1996). Specifically, MYB contributes to a cell’s secondary 
metabolic processes by taking part in regulating the transcribed 
activity of phenylalanine ammonia-lyase gene (PAL). MYB can 
regulate cell morphology by activating particular materials syn¬ 
thesis in epidennal cells (Wang et al. 2003). They participate in 
stress response by inducing the transcription of genes in the 
ABA pathway (Abe et al. 1997). In forestry plants, the MYB 
genes help regulate the fonnation of the tiny tube and xylem by 
controlling the expressions of the cinnamoyl-CoA reductase gene 
and cinnamyl alcohol dehydrogenase gene (Goicoechea et al. 
2005). In addition, they control the fonnation of a cork layer by 
regulating the synthesis of lignin (Lopes et al. 2001; Patalaff et al. 
2003). 

Plant MYB proteins were classified into three major types: 
R2R3-MYB, with two adjacent MYB repeats domain; 
R1R2R3-MYB, with three adjacent MYB repeats domain; and a 
heterogeneous group collectively referred to as the MYB-related 
proteins, which usually but not always contain a single MYB 
domain (Rosinski and Atchley 1998; Jin and Martin 1999; 
Stracke et al. 2001). MYB transcription factors usually contain 
an incompletion repeat area of 51-52 amino acids, which include 
several highly conserved amino acids. The first feature within 
this conserved domain is several tryptophanes separated by ap¬ 
proximately 17-19 amino acids. This characteristic plays an 
important role during formation of hydrophobic groups in the 
protein spatial structure. Sometimes, the tryptophane in this posi¬ 
tion would be replaced by other aromatic amino acids or a hy¬ 
drophobic amino acid. Especially in the R2R3-MYB in plants, 
which contain two MYB domains, the first tryptophane in the 
second domain is usually replaced by leucine, isoleucine, or 

^ Springer 





796 


Journal of Forestry Research (2014) 25(4): 795-804 


phenylalanine. The second feature in this conserved domain is 
that some conserved amino acids exist flanking each tryptophane. 
For example, a cluster of acidic amino acids exist in the C ter¬ 
minal of the first tryptophane. Those conserved amino acids are 
required for the fonnation of a helix-helix-turn-helix structure, 
which is functionally important for the MYB transcription fac¬ 
tors (Bilaud et al. 1996; Braun et al. 1999; Yu et al. 2000). 

In the past decade, the MYB genes have been extensively 
studied (Chen et al. 2006; Zhang et al. 2011; Du et al. 2012). The 
fact that these genes exist widely in eukaryotes suggests that they 
may be very ancient genes. However, only a few members of this 
family have been discovered in gymnosperms (Bedon et al. 
2007). Despite recent large-scale gene discoveries for conifer 
trees such as pine and spruce (Kirst et al. 2003; Pavy et al. 2005), 
only a few regulatory gene families have been systematically 
studied in the gymnosperm class. One example is the Taxodi- 
aceae family, which are relict plants from Cretaceous period. The 
family was an important component in forest vegetation of the 
northern hemisphere from the late Cretaceous to the mid-tertiary 
eras approximately 115 to 30 million years ago. In the Tertiary 
and Pleistocene, however, the family underwent a widespread 
reduction resulting in the present-day relictual genera with re¬ 
stricted distributions (Sehlabraum and Tsuehiay 1984). At pre¬ 


sent, the Taxodiaceae consists of 10 arborescent genera: seven of 
which are monotypic, and the remainder are olictypic genera 
consisting of only two or three species, resulting in a wide range 
of morphological characteristics (Wu 1998). Up to now, there are 
few MYB gene was cloned and the characteristics of MYB genes 
within Taxodiaceae family is still unambiguous. As an important 
transcript in stress response, when the Taxodiaceae family went 
through the great changes of climate in the Tertiary and Pleisto¬ 
cene, MYB should play a role to cope with weather changes. 
Therefore, to study the evolution of MYB genes within this fam¬ 
ily would be necessary. 

Materials and methods 

Plant materials 

Fresh leaves of nine Taxodiaceae accessions representing six 
genera were collected from Hangzhou Plant Garden, Zhejiang, 
China (Table 1). The natural distribution of these six genera are 
extremely scattered, forming interrupted or isolated distribution. 


Table 1: Accessions used for Amplified Consensus Genetic Marker analysis 


Accession 

number 

Species 

genus 

Chromosome 

number 

Natural distribution 

i 

Taxodium distichum 

Taxodium 

2n=22 

North America 

2 

Taxodium ascendens. Brongn 


2n=22 

America 

5 

Cryptomeria japonica var. sinensis 


2n=22 

China 

6 

Cryptomeria japonica cv. Araucarioides 

Cryptomeria 

unknown 

Japan 

7 

Cryptomeria japonica 


2n=22 

Japan, China 

8 

Metasequoia glyptostroboides 

Metasequoia 

2n=22 

China 

9 

Cunninghamia Lanceolata 

Cunninghamia 

2n=22 

China 

10 

Taiwania cryptomerioides Hayata 

Taiwania 

2n=22 

Taiwan 

11 

Glyptostrobus pensilis 

Glyptostrobus 

2n=22 

China 


MYB genes clone 

RNAs were isolated from 500 mg of leaves from each accession, 
using the CTAB method with modification (Murray and 
Thompson 1980). They then were reversed into cDNA (coding 
DNA) as templates for PCR reactions. Primers of amplified con¬ 
sensus genetic markers targeted at the MYB gene (Lu et al. 2013) 
were used in the first PCR reaction. 

PCR was performed in 20 pL reactions containing 50 mg of 
template DNA; 0.5 mol/L of each target primer (see Table 2); 
200 mol/L of each dNTP; 1.5 mmol/L of MgCL; 1 unit of Pfu 
DNA polymerase; and 2 L of 10 X PCR reaction buffer. A touch¬ 
down PCR program (Don et al. 1991) was used: five min at 95°C; 
10 cycles of: 30 s at 95°C, 30 s at 58°C minus 0.3°C per cycle, 
one min. at 72°C; 20 cycles of: 30 s at 95°C, 30 s at 55°C, one 
min at 72°C; and seven min at 72°C for a final extension. The 
PCR products were cloned into a pGEMT-vector, and the posi- 
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tive clones were picked out. The T-vector then was identified by 
restriction endonuclease digestion. 


Table 2: Primers for target genes 


No 

Primer name 

primers sequence 



Forward primers (5'—>3') 

Reverse primers (5'—>3') 

1 

ACGM marker 

GAAATGCGAT- 

TGTGATGCAGGTCCA- 

target at MYB 

GAAAGGGAAA 

TTATGA 

2 

5'RACE 

CATCG- 

GAGCCTGCAACTC 

CCATTATCCTCCA- 

CATCAGCCACC 

3 

3'RACE 

GGTGGCTGATGTGGAG- 

GATAATGG 

GAGTTGCAGGCTCCG- 

ATG 


The correct clones were selected for sequencing. According to 
the sequencing results, 5' and 3' RACE (rapid amplification of 
cDNA ends) primers were designed (see Table 3). Following the 
RACE manual, the 5' and 3' fragments of MYB genes were ob¬ 
tained, respectively. 
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We then designed primers to clone coding sequence (CDS) of 
MYB genes from nine accessions, respectively. The primers for 
gene cloning are in Table 3. All primers used were synthesized 
by the Nanjing Jinsirui Biological Engineering & Technology 


Company (Nanjing, China). Electrophoreses of PCR products 
were run on 1.0% agarose gel. The target products were purified 
and cloned into pGEM-T vector (Promega, USA) for sequencing 
(Sangon, Shanghai, China). 


Table 3: Primers for cloning genes 


No. 

Gene name 

Accession genera 

primers for CDS cloning 

Forward primers ( 5'—>3') 

Reverse primers (5'—>3') 

i 

TdMYB 

T. distichum 

ATGAAAGGGAAAATCAGTA 

TCAGTAGATGGTGCACGATCC 

2 

TaMYB 

T. ascendens 

ATGAAAGGGAAAATCAGTA 

TCAGTAGATGGTGCACGATCC 

3 

CjsMYB 

C. japonica var. sinensis 

ATGAAAGGGAAAATCAGG 

TCAGTAGATGGTACAAGAGC 

4 

CjaMYB 

C.japonica cv. Araucarioides 

AT GAAAGGGAAAAGCAGG 

TCAGTAGATGGTACAAGAGC 

5 

CjMYB 

C. japonica 

AT GAAAGGGAAAAGCAGG 

TCAGTAGATGGTACAAGAGC 

6 

MgMYB 

M. glyptostroboides 

ATGAAAGGGAAAATCAGG 

TCAGTAGATGGTGCAGGATC 

7 

C1MYB 

C. Lanceolata 

AT GAAAGGGAAAAACAGG 

TCAGTAGATGGTGCAAGAGCC 

8 

TcMYB 

T. cryptomerioides 

ATGAAGGGGATGAACAGG 

TCAGTAGATGGTGCAGGAGCC 

9 

GpMYB 

G. pensilis 

ATGAAAGGGAAAATCAGG 

TCAGTAGATGGTGCAGGATC 


Note: The start and stop codons were underlined. 


Characterization of the deduced MYB protein 

Amino acid sequences of MYBs were deduced using DNAman 
software. The isoelctric points and the molecular weights of the 
deduced MYB polypeptides were calculated with the ProtParam 
tool (http://web.expasy.org/protparam). The conserved domains 
of MYBs were also predicted by SMART 
(http://smart.embl-heidelberg.de). Secondary structure of each 
MYB protein was checked in GOR (http://npsa-pbil.ibcp.fr). 
Finally, the three-dimensional model of each MYB was set up 
with 3D-pssm of ExPASy (http://en.wikipedia.org). 

Homologous analysis 

The nine MYBs from Taxodiaceae plants obtained in this study, 
as well as with MYBs from other plant species, including Hor- 
deum vulgare, Triticum aestivum. Solatium lycopersicum, Lolium 
temulentum, Medicago truncatula and Arabidopsis thaliana, 
were alignment by Clustal X. Phylogenetic tree was constructed 
using Mega 5. 


Results 

Analysis of MYB sequence 

Among the sequences, the start codon and termination codon 
were all obtained in those nine MYB nucleotides, respectively. 
Database search with BlastN at NCBI (www.ncbi.nlm.nih.gov) 
showed that the nine MYB nucleotide sequences had a high 
similarity (> 70%) with other MYB genes from both gymno- 
sperm and angiosperm plant species. Those results showed that 
the full coding sequences of nine MYB genes in Taxodiaceae 
plants were obtained, respectively (Fig. 1). The lengths of the 
nine MYB genes range from 1371-1392 bp (Fig. 2). The lengths 


of deduced amino acids range from 456 to 463 (Fig. 3). Further 
analysis showed that the deduced amino acid sequences of 
MYBs contained several variation sites. According to sequences 
of multiple alignments with Clustal X, serine deletion mutations 
were found at amino acid position 21 in MgMYB, GpMYB, 
C1MYB, and TcMYB, and continuous amino acid deletion at 256 
in TcMYB. Serine insertion mutations were found at amino acid 
position 225 in TdMYB and TaMYB, at 190 in C1MYB, at 341 in 
C1MYB and TcMYB, respectively. 


2000bp 

lOOObp 

750bp 

500bp 

250bp 

lOObp 



Fig. 1: PCR resules of 9 MYB genes 

Conserved domain analysis of the MYB protein 

MYB transcription factors usually contain an incompletion re¬ 
peat area of 51-52 amino acids, which include several highly 
conservative amino acids. In our study, the deduced amino acid 
sequences for MYBs showed that each MYB protein contained 
two DNA binding domains (Chen and Wang 2002). The first 
domain is from amino acid position 29 to 78, wherein three tryp¬ 
tophanes (W) at 33, 53 and 73 were separated by 19 amino acids, 
respectively (Fig. 3). The second domain is from amino acid 
position 82 to 127, wherein three tryptophanes (W) at 86, 105 
and 124 were separated by 18 amino acids, respectively, whereas 
the first tryptophane at position 86 is replaced by a phenylalanin 
(F). The characterization of these conserved domains at nine 
MYB indicates that they all belong to R2R3-MYB group. 
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AT GAAAGGGAAAA 

pgaaIgggaHa 

fgaabgggaHa 

tgaaSgggaHa 

AT GAAlGGGAmA 
AT GAaSgGGaHa 
r gaabgggaHa 
fgaaIgggaBa 





aabg 
aaIg 
cagBaaabg 
:agg.a.a.aag 

lAGjGAAAAG 
IAGGAAAAG 

caggaaaAg 

CAGGAAAAGA 


GaHcHc. 


cbtbtb 

lc i T l T i 

'cItStt 

CTTCTT 
' CfflT |t STj 

IC l T l T i 

CTTCTT 

C@T@TT 

CBTlTr 


:T T CT T C5*^gT C.AAAACCAGACAAT AATAT G 
CT T CT T cFJBgT CA_«A-ACCAGACAAT AATAT G 
: T T CT T cBt CAAAACCAGA.CAAT AATAT G 
:TTCTTI BSE t CAAAACCAGACAATAATAT G 
IT T CT T C^^Ht C.AAAACCAGACAAT AATAT G 
CTTCTT cFEBHt CA_AAACCAGA.CAAT AA.TAT G 
CTTCTT c UCTH t CAAAACCAGAjdAjST AAMAT G 
TAA0ATG 
TAATATG 


■GGGCCflTG 

Iggggccatgg, 

■GGGGC CAT GGj 
■GGGGC C'AT GG, 

■gggccIt g 
■gggccIt g 

■GGGGC CAT GG, 


GgCTiCg 

jGgCTGCg 

igBctIcB 

gHctgcB 


ATGAAATT 

ATGAAATT 

ATGAAATT 

ATGAAATT 

ATGAAATT 

ATGAAATT 

ATGAAATT 

ATGAAATT 

atgaaatt| 


TGGTGGCTTA 

TGGTGGCTTA 


TGGTGGCTTA GTGAAGGAGAATG 


TGGTGGCTTA 

TGGTGGCTTA 

TGGTGGCTTA 

TGGTGGCTTA 


TGGTGGCTTJ 


GT GAAGGAGAAT GGfl 
GT GAAGGAGAAT GGfl 


GT GAAGGAGAAT GG 
CT GAAGGAGAAT CG 
GT GAAGGAGAAT GG 
GT GAAGGAGAAT GG 


TGGTGGCTTA GTGAAGGAGAATGG 


GTGAAGGAGAATG 

m 


GAGGGGAACTGGAATT CT GT BCAGAABCAflT| 
GAGGGGAACTGGAAT T CT GTBCAGAABC, 
GAGGGGAACTGGAAT T CT GTBC AGAABC, 
GAGGGGAACT GGAAT T CT GTICAGAaIc, 
GAGGGGAACTCCAATTCTGTBCACAaSc, 
GAGGGGAACT GGAAT T CT GtScAGAaIc. 
GAGGGGAACTGGAATTCTGtIcAGAABC. 
GAGGGGAACT GGAAT T CT GtEcAGAaHc. 
GAGGGGAACT GGAAT T CT GtBcAGAaIcaBT| 


■CTGGTTTGTS 

IcTGGTTTGTS 

IcTGGTTTGTB 

IctggtttgtB 

IctggtttgtB 

[CTGGTTTGTS 

IctggtttgtB 

■CTGGTTTGTy 

IctggtttgtB 


TCGCT' 

TCGCT' 

TCGCT' 

TCGCT' 

TCGCT' 

TCGCT' 

TCGCT' 

TCGCT' 

TCGCT' 


GT 

GGAAAGAGjrjTC 

GT 1 

ggaaagagStc 

GT 

ggaaagagBtc 

GB 

ggaaagagStc 

GT 

^GAAAGAGT t 

Gt 

ggaaagagStc 

GT 

ggaaagagStc 

GT 

^gaaagagIt 

ggaaagagStc 


gagBtgca 


3GCTBCGAT GGBCBAACCAT ■GCGBCCBAAT CTgAAlAAGGGCGCCT T T T 
ggctIcgat ggbcSaaccat HgcgIccIa at ctBaaIaagggc GCCT T T T 
GGCT IcGAT GGBCBAACCAT HgCgHccSaAT CtHaABAAGGGCGCCT T T T C|C( 
ggctIcgat ggHcSaaccatHgcgSccHaat ctHaaIaagggcgcct t t t clc( 
ggctIcgat ggHcbaaccat HgcgIccIaat ctHaaIaagggcgcct t t t cicc 
ggctIcgat ggIcbaaccat HgcgIccIaat ctHaaIaagggcgcct t t t cicc 
ggctIcgat ggIcbaaccat HgcgIccIaat ctHaaIaagggcgcct t tt clc 
gcaggctHcgat ggbcHaacc atHgcgbccIaat ctHaaIaagggcgcct t t t clc 
Icgat ggIcIaaccatHBgcgIccHaat ctHaaIaagggcgcct t t t cicc 


Igaagagcgct t 

IgaagBgcgctt 

Igaagagcgct t 
Igaagagcgct t 

Igaagagcgctt 

Igaagagcgctt 
IgaagIgcgctIt 
IgaagIgcc 
Igaagagcc 


CAT BAT GGABCT GCAT CBCA 


CAT ATGGA 
CAT»ATGGA 
CAT ATGGA 
CAT ATGGA 
CAT ATGGA 
CAT ATGGA 


CAT ATGGA CTGCAT 


CAT ATGGA 


CTGCATC 

CTGCATC 

CTGCATC 

CTGCATC 

CTGCATC 

CTGCATC 


CTGCATC 


gBt BGGCAACAAGT 
gIt BGGCAACAAGT 
GBTBGGCAACAAGT 
GBT BGGCAACAAGT 
gIt BGGCAACAAGT 
gIt BGGCAACAAGT 
,g|t BGGCAACAAGT 
Iggcaacaagt 

GGCAACAAGT 


GGGCTCGAATGGCCGCCT T GAT GC CflGGGCGCACBGATAATGA 
GGGCTCGAAT GGCCGCCT T GAT GC CBGGGCGCAcIgATAATGA 
GGGCT CGAAT GGCCGCCT T GAT GCCBGGGCGCAcIgATAAT GA 
GGGCT CGAAT GGCCGCCT T GAT GCCBGGGCGCACBGATAAT GA 
GGGCT CGAAT GGCCGCCT T GAT GCCBGGGCGCACBGATAAT GA 
GGGCT CGAAT GGCCGCCT T GAT GCCBGGGCGCACBGATAAT GA 
GGGCT CGAAT GGCCGCCT T GAT GCCBGGGCGCACraGATAAT GA 
GGGCT CGAAT GGCCGCCT T GAT GCCraGGGCGCA cIgATAATGA 
GGGCT CGAAT GGCCGCCT T GAT GCcIgGGCGCAC0GAT AATGA 


TAAAGAAt 

TAAAGAAt 

TAAAGAAt 

TAAAGAAt 

TAAAGAAt 

TAAAGAAt 

TAAAGAAt 

TAAAGAAt 

TAAAGAAt 


TgT gg^Macacgaat c 

T T CT GGAACACAC GAATC 
TTCTGGAACACACGAATC 

tHtggSHacacgaatc 

CT^TG G^^A CACGAATC 
TTCTGGA0CACACGAATC 
t|tGGAACACACGAAT C 
GGAA0ACACGAATC 

gg^Hacacgaat C 


\gBaggat 

iGSAGGAT 

\gIaggat 

\G0AGGAT 

\gIaggat 

\GBAGGAT 

^gIaggat 

\gIaggat 

^gIaggat 


GCGCTTGGQ 

GCGCTTGGG 

GCGCTTGGG 

GCGCTTGGG 

GCGCTTGGG 

GCGCTTGGG 

GCGCTTGGG 

GCGCTTGGG 

GCGCTTGGG 


TTCCCT 

TTCCCT 

TTCCCT 

TTCCCT 

TTCCCT 

TTCCCT 

TTCCCT 

TTCCCT 

TTCCCT 


TACCCTGT TBT GAAGGC. 
taccctgttH GAAGGCA 
TACCCT GT tBB gaaggca E 
taccctgttS 


taccctgtt| 

taccctgttBt 


’GATGAAG 
’GATGAAGgGA 
GATG 

gaaggcaEt gat gaag 

TACCCT GT TTOSGAAGGCaHt GAT GAAG0GA 

taccctgttH gaaggcaHtgatgaag 
TACCCTGTTBT GAAGGCA0TGATGAAG 

GAAG 


0TGATGAA 


.GCTTGGG CtTTCCCTgTACCCTGTTc 


AGAATT T GGGCAGCCATflC 
AGAATT TBGBCAGCCAT Ic 
AGAAT T T BGBCAGCCAT flC 

Iagaat t tSgSc AGCCAT ic 

lAGAAT T T GGGCAGCCATTC 

Iagaatt t gggcagccattc 
IagaatttSgScagccat Sc 

]aGAATTT lGgCAGCCATTC 

AGAATTT0G 


PGTTGGAGAATTCGAACCgllGfSGTCTATAACAgTAATGATAACAgCAGT 
P GT fflGGpGAAT T CGAACcHGflGmCT AT AACAgT AAT GAT AAC A@CAGtHt 
:GT BGGBGAAT T CGAAC CCAGGGT C T ATAACAgTAATGATAACAACAGTAT T 
P GtIggIgAAT T CGAACCCAGGGT CT AIAACaStAATGATAACAAC AGT AT T 
PGtIggIgAATTCGAACcHg^HjT AT AACAgT AAT GAT AAC AACAGT ATT 

ccttggagaattcgaacccagggtctataacaHtaatgataacaacagtaTt 
? GtIggIgAATT CGAACCCAGGGTCTATAACAgTAATGATAACAACAGTATT 
P GtIggHgAAT T CGAACcHg^HcT AT AACABTAAT GAT AAC AACAGT AT T 


GGAGAGAGT TTTC 

gIagaHHgIttt! 

gbagaHgIt T TR 
gbagaHgbtttI 

GGAGAGAGT TTTC 

gbagaHgIt T TI 
gIagaHg|ttt| 
ggagagag 
agaHgStttI 


^TGTTi 

igtgttI 
IgtgTtI 
igtgttI 
igtgttI 
igtgt : I 
igtgttI 
igtgttI 
;gtgtt| 


■TCTTATGAGTTGATGA TAA A 
■TCTTATGAGTTGATGA TAA A 

Itcttatgagttgatgagtaa a 

■TCTTATGAGTTGATGA TAA A* 
■TCTTATGAGTTGATGA TAA A 
■TCTTATGAGTTGATGA TAAflA 
It CT T AT GAGT T GAT GA 
It CT T AT GAGT T GAT GA 
|t CT T AT GAGT T GAT G, 


TAATAATAATB 
TAATAATAATI 
TAATAATAATI 

ta.ata_ata.atS 

ta_ata_ata_atE 

taataataatI 

T AATAATAATI 

taataataatI 

taataataatI 


T GT GT T GAAAGAGT 
T GT GT T GAAAGAGT 
TGTGTTGAAAGAGT 
T GT GT T GAAAGAGT 
TGTGTT GAAAGAGT 
T GTGT T GAAAGAGT 
TGTGTTGAAAGAGT 
TGTGTTGAAAGAGT 
It GT GT TGAAA GAGT 


-jBCAAGGAAC 
sIcAAGGAAC 
-'ACAAGGAAC 
--ACAAGGAAC 
-A C AA G G AA C 
sIcAAGGAAC 
JBCAAGGAAC 
JcAAGGAAC 
CAAGGAAC 


T GACAGT AC T AGT G 
ST GACAGTAC T AGT G 
®TGACAGTACTAGTG 

I T GAMAGT AC T AGT G 
TGACAGTACTAGTG 
T GACAGT ACT AGT G 
TGACAGTACTAGTG 

I tact^ 
stgaSHtactI 




.TGATi 

.'■AT I 

.tgatI 

.tgatI 

.tgatI 

■AT I 

.tgatI 

atgatI 

,tgat| 


|AC AAGT T AGT 

Iacaagttagt 

Iacaagttagt 

Iacaagttagt 

Iacaagttagt 

Iacaagttagt 

ACAAGTTAGT 

Iacaagttagt 

ACAAGTTAGT 


GGBAGCAC CAAT AACAC T A 

ggIagcaccaataacacta 
ggIagcaccaataacacta 
ggIagcaccaataacact^ 
ggIagcaccaataacacta 
GgIaGC ACC AATAACACT A 
GgIaGC AC C AAT AAC AC T A 
GGHAGCACCAATAACACT A 
GGBAGC AC C AAT AAC AC T A. 


AGTGT CAGTAGTAGTA 
AGT GT CAGT AGT AGT A 
AGTGTCAGTAGTAGTA 
AGT GT CAGT AGT AGT A 
AGTGTCAGTAGTAGTA 
AGTGTCAGTAGTAGTA 
AGT GT CAGT AGT AGT A 
AGT GT CAGT AGT AGT A 
AGT GT CAGT AGT AGT A 


0 T AAT AGT A 
It AAT AGT A 
It AAT AGT A 

StAATAGTA 
StAATAGTA 
StAATAGTA 
StAATAGTA 
StAATAGTA 


j AATAA 

tcattHctactctgtaaa 

gggg 

AATAA 

tcattHctactctgtaaa 

GGGG 

AATAA 

tcattHctactctgtaaa 

GGGG 

i AATAA 

tcattHctactctgtaaa 

GGGG 

AATAA 

tcattHctactctgtaaa 

GGGG 

:AATAA 

tcattHctactctgtaaaI 

GGGG 

AATAA 

TCATTHCTACTCTGTAAA 

GGGG 

AATAA 

tTOttHctactctgtaaa 

GGGG 

|AATAA 

3cBtt@ctactctgtaaa 

GGGG 












































































Journal of Forestry Research (2014) 25(4): 795-804 



CAGGCCTCAB 




S CA S 


BTjGTC 

ttHtItc 

ttHtItc 

ttmt^tc 

ttHtItc 

ttHtItc 

ttHtItc 


ggai 

5CT CCCT T CAAGCC 

: AATfflAGTGT T GT cIt T AC 


5rrr at. 

GGAC 

5CTCCCTTCAAGCC 

-■- 

:aatbagtgt t gt c 

TTAC 


CCCAC. 

GGAC 

xri’prr’TTr a Ar;r*r 

iaatbagtgt t gt c 

TTAC 


CCCAC. 

GGAC 

SCTCCCTTCAAGCC 

:aatHagt gt t gt c 

TTAC 


rrrAr 

GGAC 

SCTCCCTTCAAGCC 

: aatBagt gt t gt c 

TTAC 


IrrrAG 

CTAf 

SCTCCCT T CAAGCC 

:aatHagtgttgtc 

TTAC 

SACACCAcj 

Irrr AG 

(TAC 

SCTCCCTTCAAGCC 

; aatHagt gt t gt c 

TTAC 

SACAHCACg 

CCCAC. 


SCTCCCTTCAAGCC 

: AAT Hagt gt t gt c 

TTAC 

sacaHcacJ 

CCCAC. 


SCTCCCTTCAAGCC 

: aatHagt gt t gt c 

TTAC 

sacaIcac! 

ICCCAC. 



rATCTT^x«g 
CATCTTAT11 
CATCTTAT11 
CAT CT TAT I S' 

zatcttatIJ 

zatcttatKt| 

zatcttatH' 

CATCTTAT^S' 

iatcttatBb' 



BTTTGGGTGTA 
fflTTTGGGTGT/k 
■TTTGGGTGT^ 
■ttTGGGT GTA 
iTTTGGGT GTA 

■tttgggtgt^ 
Itttgggtgta 

BT T T GGGT GT A 
Htttgggtgt^ 


CT T CT GGATGCACT 
CT T CT GGAT GCACT 
CT T CT GGAT GCACT G 
CT T CT GGAT GCACT G 
CT T CT GGAT GCACT G 
CT T CT GGAT GCACT G 
CT T CT GGAT GCACT G 
CT T CTGGAT GCACT G 
CT T CT GGAT GCACT G 


GGGAGAGA Ca 


TGCTG 

TGCTG 

TGCTG 

TGCTG 

TGCTG 

TGCTG 

TGCTG 

TGCTG 

TGCTG 


_ f:IiHB 

■aItctttcaag 

HaItctttcaag 

TAT TCT T TCAAGl 
T AT T CT T T CAAGl 
TATTCT T T CAAG 
TAT TCT T T CAAG 
IaIt CT T TCAAGl 

IaBtctttcaag| 

■aItctttcaag 


VATAAT ATI 

vataatat! 

ataatatI 

ataatatI 

ataatatI 

A AT Ah TAT I 

aat aat at I 
■ataatatI 

AATAATATl 

TJX 


GATATGA TTAT 




GAT AT GAB 
TGATATGAB 

tgatatga! 

tgatatga! 

tgatatga! 

tgatatga! 

tgatatgaT 

TGATATGAB' 


TTAT.fi 
TTAT.fi 
TTATA 
TTAT fi 
TTAT fi 
TTATA 
TTAT^ 
TTATA 


ABTTGTAC 

aIttgtac 

ABTTGTAC 

aIttgtac 

aIttgtac 

ABTTGTAC 

aIttgtac 

a|ttgtac 

aBttgtac 


lGTTmATGATG 

lGTtKaTGATG 

lGTtBHaTGATG 

.GTtHaTGATG 

lGTtKaTGATG 


Eat gat g 

^ATGATG 
ATGATG 
ATGATG 


ATCT 

ATCTGG 

ATCTGG 

ATCTGG 

ATCTGG 

ATCTGG 

ATCTGG 

ATCTGG 

ATCTG< 


CTATATA 
GACTATATA 
GACTATATA 
GACTATATA 
GACTATATA 
GACTATATA 
GACTATATA 
GACTATATA 
GACTATATA 


A.T GGAT ATGCCBGCT GAAAT GGT T C T C 


AT GGAT AT GC C 
AT GGAT AT GC C 
AT GGAT AT GC C 
AT GGAT AT GC C 
AT GGAT AT GC C 
AT GGAT AT GC C 
AT GGAT AT GC C 
ATGGATATGCC 


GCTGAAATGGTTCTCA 
GCT GAAAT GGT T CT CA 
GCT GAAAT GGT T CT CA 
GCT GAAAT GGT T CT CA 
GCTGAAATGGTTCTCA 
GC T GAAAT GGT T CT CA 
GCT GAAAT GGT T CT CA 
GCT GAAAT GGT T CT CA 


SAACCAT T CTTAGAATAj 
SAACCAT T CTTAGAATAj 
5AAC CAT T C T T AGAAT a| 
14AAC CAT T CT T AGAAT a| 
ISAAC CAT T CT T AGAAT A| 
SAAC CAT T C T T AGAAT A 
GAAC CAT TjCTT AGAAT a| 
GaBcBBtBc T T AGAAT Aj 
IAACc BEC c T T AGA.fi T Al 

me 


CBTGAATGGT 

cItgaatggt 

CBTGAATGGT' 
CBTGAATGGT' 
CBTGAATGGT' 

cItgaatggt' 
JCBTGAATGGT' 
jgTSclTGAATGGT 
gc0T GAAT GGT 

m 


■ctcccatcaacactBt^Bc 
BCT CCCAT CAACACT BT^Hc 

IctcccatcaacactIt^Hc 

!cT CCCAT CAACACTBT^Mc 
BCT CCCAT CAACACTBT^Hc 
BCT CCCAT CAACACTBT^Hc 
GCA|AGCC|CT CCCAT CAACACTBT^Hc 
CTCCCATCAACACTBtB 
CTCCCATCAACACT0tR 


sA.T T CT GBCA.T I 

cctgBcat| 

P CTGSC.fi 
:tgHcat| 




fi-AAGCTGC 
AAGCTG 
“AAGCTG 
AAGCTG 
AAAGc|t G 
-AAGCTG 
AAGCTG 


CTTCCTAGAGCTG 

CTTCCTAGAGCTGC 

CTTCCTAGAGCTG 

CTTCCTAGAGCTGC 

CTTCCTAGAGCTGC 

CTTCCTAGAGCTG 

CTTCCTAGAGCTG 

CTTCCTAGAGCTGC 

CTTCCTAGAGCTG 


AT ACS'_ 
ATAC0G 
ATAC0G 
ATACgC 
--fiTACgcB 


GTAT GGCCATATATAAT T T GGT GAATA 
GT AT GGCCAT AT AT AAT T T GGT GAAT A 
GTATGGCCATATAT AAT T T GGTGAAT A 
GT AT GGCCAT AT AT AAT TT GGT GAAT A 
GTAT GGCCATATATAAT T T GGT GAATA| 
GTAT GGCCATATATAAT T T GGT GAATA 
GTATGGCCATATAT AAT T T GGTGAATA 
GT AT GGCCAT AT AT AAT T T GGT GAAT A 
GT ATGGCCAT AT AT AAT T T GGT GAAT A| 


& 




















rrr Jr * 




A 

Sca|cacc 

AATACATAA 

CAT 

gggg!r!ga 

TGTGGG 

GT GGAACAACATGCCT GGT GCCT GT CAgIt AAgIgAcItACACT CCACT CT 

GAT 

A 

IcABCACC 

AATACATAA 

CAT 

ggggIga 

tgtGgg 

GT GGAACAACAT GCCT GGT GCCT GT CAGBT AAGBGAcHtACACTCCACT CT 

GAT 

fi 

IcaIcacc 

AATA.CATAA 

CAT 

GGGGBGA 

TGTGGG 

GT GGAACAACAT GCCT GGT GCCT GT CAGITAAgHgACHTACACTCCACT CT 

GAT 

A 

IcaIcacc 

AATA.CAT.AA 

CAT 

GGGGBGA 

TGTGGG 

GT GGAACAACAT GCCT GGT GCCT GT CAgIt AAGBGAcItACACTCCACT CT 

GAT 

A 

raCA0CA.CC 

AATA.CAT.AA 

ESt 

GGGGBGA 

tQtggg 
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;tIt 
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ST ACT 
ST ACT 
STAGjT 
ST ACT 
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GCAACTC 

GCAACTC 

GCAACTC 

GCAACTC 

GCAACTC 

GCAACTC 

GCAACTC 
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CATCTACTGA 
CATCTACTGA 
CATCTACTGA 
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CAt T C T AGaAC cAGGAC T AAT cT C a GT a cT GCAACT GGCcT ACAgAACAt CAAGGG TC TG ACCAT CTACT GA 

Continued Fig. 2: Multiple alignment of 9 MYB genes 
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erfimdlhhklgnkwarmaalmpgrtdneiknfw|trikrrmrlglslypvleg 1 de|knlgshsvgefe 
E RF IM|lh|KLGNKWARMAALM PGRT DNEIKNYWNTRIKRRMRLGL S LY PVflE GiJjDE GKNlJs H S VGE F E 
E r|iMDLHHK|gNKWARMAALM PGRT DNEIKNYWNT RIKRRMRLGL S LY PVLE GSDE GKNLGS hBvGE F E 


PRVYNNN DM S 3 LGE S F GVSY 

prvyn§ndn|slgBsfgvsy 
prvyntndnnsige!f|vsy 
PRVYNTNDNN SIGE S F GVSY 
PRVYNTNDNNSIGESFGVSY 
PRVYNTNDNNSIGESFGVSY 
PRVYNTNDNNSIGESFGVSY 
p BB yN TNDNN SIGESF GVSY 
P GYYnBs Me S F GVSY 


ERFIMDLHHKLGNKWARMAALMPGRTDNEIKN 5 WITRIKRRMRLGLSLYPVLEGSDEGKNLGSHSVGEFE PRVYNTNDNNS 6 GE SFGVSY 


elmsntnnnMcvervtrntdstsaqmiqvsgstnntIsvsssssnsnnnh 

E LM SNTNNNmCVE RVT RNT D S T S AQMIQVS GS TNNtIs VS S S S SN SNNNh| 
E LM SNTNNnHcVE RVT RNT D S T S AQMI QVS GS TNNT|S VS S S S 
E LM SNTNNNBcVE RVT RNT D S T S AQMIQV S G S TMM T K S V S S S S 
E LM SNTNNnHcVE RVT RnBd S T SaMmIQVSGSTNNTN S VS S S S 
E LM SNTNNnHcVE RVT RN|D S T S A|MI QVS GS TNNTN S VS S S S 
E LM SNTNNnHcVE RVT RnJd STS AgMI QVSGSTNNTNSVSSSS 

IQVSGSTNNTNSVS S S S 
IQVSGSTNNTNSVS S ST 


E LM SNiNNNNNCVE RVT RNT d|t| 

” [ftsli - " " 


h|n 


SVNGAg 

svngaI 
nsnnnhSysvngaI 

NSNNNHgYSVNG a| 
NsBnNHCYSVNGAS 
nsInnhcysvngas 

NSgNNHCYSVNGAS 

NSNNN 0 CYSVNGAS 

nsBnHcysvngas 


SGLISELPS SQSVLSLDTT 
SGLISELPS SQSVLSLDTT 
SGLISELPSSQSVLSLDTT 
SGLISELPSSQSVLSLDTT 

elpssqBvlsldtt 

ELPSSQHVLSLDTT 

[elpssqBvlsldtt 
sglBselpssqsvlsldst 

SGlIsELPSSQSVLSLDTT 


PTSY|YSFKLLS| 

PTSYfflSFKLLS 

ptsyBysfklls 

PTSYgYSFKLLSl 
PTSYDYSFKLMs| 
PT SYDYSFKLMS 
PTSYDYSFKLMS 
PTSYNYSFKLLS 
PT SYNYSFKLLSl 


r SAQMIQVSGSTNNTNSVS SS 3 SNSNNNHCYSVNGASSGLISELPS SQSVLSLD 3 T PTSY 1 YSFKL 6 S 


MF GCT S S GLLDALLLQD S HNNIDMNYT QE WGE T KT E LDDVS GDYIIT 

egeBRmfgctssglldalllqdshnnidmnytqewgetktelddvsgdyiBt 

E GeHnF GCT S S GLLDALLLQD S HNNI DMNYT QE WGE T KT E lBdVS GDYI IT 
E GENjNFGCT S S GLLDALLLQD S HNNI DMNYT QE WGE T KT E LgDVS GDYI IT 
EGE S SNFGCT S SGLLDALLLQDsBnNIDMNYTQEWGETKTELDDVSGDYIIT 
E GE S SNF GC T S S GL L D AL L L Q D S DNNIDMM Y T Q E WGE TKTELDDVS GDYI IT 
E GE S SNFGCT S S GLLDALLLQD SgNNI DMNYT QE WGE T KT E LDDVS GDYI IT 
EGESSNFGCSST GLLDALLLQD S HNNI DMflYT QE WGE T KT eBnDVS GDYI IT 
E GE SijNF GC|S S GLLDALLLQD S HNNI DMNYT QE I VGE T KT E LNDBs GDYI IT 


SAT P PMDMPAEMVLTS Hg 
sat|pmdm paemvlt sh! 
SAT P PMDMPAEMVLT S hS 
SAT P PMDMPAEMVLT S HE 
SAT P PMDMPAEMVLS S hS 
SAT P PMDMPAEMVL S S HE 
SAT P PMDMPAEMVL S S HE 


EMLRETAKEVKIEPFL 
EMLRETAKEVKIE PFL 
EVLRETAKEVKIE PFL 
EMLRETAKEVKIEPFL 
EMLRETAKEVKIEPFL 
EMLRETAKEVKIEPFL 

emlretakevkIepfl 


SAT P PM DM PAEMVlBs HONOWEmMe T a|e VKI e|yl| 

satppmdmpaemvlsshqnqwemlretakevkiepF 


EGESSNFGC 3 S 3 GLLDALLLQDSHNNIDMNYTQE 6 VGETKTEL 1 DVSGDYIITSATPPMDMPAEMVL 3 SHQNQWE 6 LRETAKEVKIEP 5 L 


|eyiddsd@lpewckaspintlsnsaikesctlpraat| 

IeyiddsdllpewckaspintlsnsaikesctlpraatI 
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Fig. 3: Multiple alignment of MYB protein. Note: Conserved domains are underlined. 


(Table 4). The numbers of a-helix and )5-turn among nine MYBs 
Prediction of MYB protein range from 7 to 10 and 19 to 25, respectively. The variations of 

secondary structure between the nine MYBs are resulted from 
The secondary structures analysis showed that a-helix and (S-turn the amino acid sequences variations. Mutation of serine (S) to 

are the major motifs of predicted secondary structure of MYBs phenylalanine (F) at amino acid position 64 in MgMYB, C1MYB, 
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and GpMYB result in a (3-turn insertion in these positions, re¬ 
spectively. Whereas, mutation of leucine (L) to serine (S) at po¬ 
sition 78 in TaMYB, as well as alanine (A) to serine (S) at posi¬ 
tion 75 in CjsMYB, resulted in a a-helix deletion in these posi¬ 
tions, respectively. Although some variations exist among the 
nine MYB proteins, their secondary structures are similar, which 
showed the conservation of MYBs in evolution. The 
three-dimensional model of each MYB protein was set up with 
3D-pssm of ExPASy (Fig. 4). The two DNA binding domains of 
MYBs can be seen clearly. The structure of each MYB is like a 
clip, resulting in the MYB transcription factors having more 
flexibility and mobility when binding target DNA. 

Homologous analysis 

Database search with BlastP in NCBI and multi-alignment 
Clustal X 1.83 showed that the deduced MYB proteins had simi¬ 
larity (from 74% to 78%) with MYBs from other plant species 
(Fig. 5). The similarities among MYB proteins we obtained in 
Taxodiaceae were calculated. The highest identical value of 99% 


is between CjsMYB, CjMYB and CjaMYB, whereas the lowest 
value of 82% is between TaMYB and C1MYB. The hylogenetic 
tree was constructed with Mega5 (Fig. 6). According to the tree, 
the distances between different genera were relatively large 
whereas those within genera were relatively small. As expected, 
accessions of the same genus fonned a subgroup before being 
grouped with other genera. 


Table 4: Results of sequence analysis 


No. 

Gene 

Name 

Length of 
CDS (bp) 

Length of Molecular 
amino acid Weight (Da) 

PI 

Numbers 

of a-helix 

Numbers 
of (3-turn 

1 

TdMYB 

1392 

463 

51061.6 

5.54 

8 

25 

2 

TaMYB 

1392 

463 

50911.3 

5.67 

7 

25 

3 

CjsMYB 

1389 

462 

51318.9 

5.22 

7 

24 

4 

CjaMYB 

1389 

462 

51277.9 

5.28 

8 

24 

5 

CjMYB 

1389 

462 

51302.9 

5.22 

8 

23 

6 

MgMYB 

1386 

461 

51202.9 

5.89 

8 

24 

7 

C1MYB 

1392 

463 

51824.3 

5.63 

9 

25 

8 

TcMYB 

1371 

456 

50827.5 

5.85 

10 

19 

9 

GpMYB 

1386 

461 

51250.0 

5.80 

8 

25 



Fig. 4: Predict the three-dimensional of MYB protein. Note: The number one to nine correspond to the accession number in Table 1. 
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Fig. 5: Multiple alignment of MYB protein among different species: Hor- 
deum vulgare (X87690.1), Triticum aestivum (AY615200.1), Solanum ly- 
copersicum (NM OO 1247433.1), Lolium temulentum (AF114162.1), Medi- 
cago truncatula (XM 003589214.1), Arabidopsis thaliana (NM_118827.1); 
Numbers correspond to accessions in Table 1. 
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Fig. 6 : Phylogenetic tree of MYB protein 


Discussion 

To achieve an overall picture of this gene super-family in relict 
plants, we have gained nine accessions of R2R3-MYB genes in 
the Taxodiaceae family. In the past decade, the R2R3-MYB 
genes have been extensively studied. They were reported to be 
involved in many physiological and biochemical processes. The 
members in this group are numerous. Chen et al. (2006) identi¬ 
fied 198 genes in the MYB superfamily from an analysis of the 
complete Arabidopsis genome sequence; among them, 126 
(63.6%) are R2R3-MYB genes. However, bio-information on 
Taxodiaceae plants is limited and the genome of these plants 
remains unknown. We designed primers according to a reserved 
sequence of MYB domain and obtained MYB genes from nine 
accessions of the Taxodiaceae family. The nine MYB genes all 
belong to R2R3-MYB, which indicated that R2R3-MYB is the 
main type in these plants. The R2R3-MYB group of transcription 
factors is one of the largest regulatory gene families known in 
plants (Cedroni et al. 2003). The MYB domain is well conserved 
among plants, yeast and animals (Lipsick 1996). Therefore the 
origin and evolutionary history of members of this large gene 
family are of interest. Similarities in the nine MYB proteins ob¬ 
tained in Taxodiaceae also closely corresponded with MYBs 
from other plant species, implicating the evolutionary conserva¬ 
tion of MYB gene. The conserved domains of R2R3-MYB in 
Taxodiaceae plants are exactly the same as the domains that are 
necessary for MYB functions. According to multiple sequences 
alignment of the nine MYB genes, the amino acids sequences of 
T. cryptomerioides and C. lanceolata showed more differences, 
mainly with insertion and deletion mutations. For example there 
are four continuous (SSSS) inserts at the amino acid position 
17-20, and four amino acids (QNQW) insert mutations at posi¬ 
tions 341-344. These variations made T. cryptomerioides and C. 
lanceolata cluster into one group and showed that the two spe¬ 
cies have very similar genetic relationship. 
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